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Abstract. In a previous paper, we have demonstrated the 
importance to define a statistical model describing the ob- 
served linear correlation between the absolute magnitude 
M and the log line width distance indicator p of galax- 
ies (the Tully-Fisher relation, Tully&Fisher (1977)). As 
long as the same statistical model is used during the cal- 
ibration step of the relation and the step of the deter- 
^£ mination of the distances of galaxies, standard statistical 



methods such as the maximum likelihood technic permits 
us to derive bias free estimators of the distances of galax- 
! i ies. However in practice, it is convenient to use a different 
O statistical model for calibrating the Tully-Fisher relation 
(because of its robustness, the Inverse Tully-Fisher rela- 
tion is prefered during this step) and for determining the 
distances of galaxies (the Direct Tully-Fisher relation is 
more accurate and robust in this case). Herein, we estab- 
lish a correspondence between the Inverse and the Direct 
Tully-Fisher approaches. Assuming a gaussian luminos- 
ity function, we prove that the ITF and DTF models are 
in fact mathematically equivalent (i.e. they describe the 
^ same physical data distribution in the TF diagram). It 
thus turns out that as long as the calibration parameters 
t^h are obtained for a given model, we can deduce the corre- 
'"p^sponding parameters of the other model. We present these 
O formulae of correspondence and discuss their validitity for 
+3 non-gaussian luminosity functions. 
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1. Introduction 

Recently, the number of theoretical works on the Tully- 
Fisher (TF) like relation used to infer redshift independent 
estimators of distance of galaxies has increased rapidly 
(Hendry et al. (1994), Willick (1994), Sandage (1994), 



Send offprint requests to: S. Rauzy 

* the European Cosmological Network 



Teerikorpi (1993), Bicknell (1992), Landy et al. (1992), 
Fouque et al. (1990), Teerikorpi (1990), Hendry et al. 
(1990)). In a previous paper (Triay et al. (1994), hereafter 
TLR) we have demonstrated the importance to define a 
statistical model describing this observed linear correla- 
tion between the absolute magnitude M and the log line 
width distance indicator p of galaxies. A random variable 
C = ap + b — M of zero mean was introduced to mimic 
the intrinsic scatter of the TF relation. In order to fully 
specify the statistical model, a second random variable £ 
of mean £o and dispersion a^, statistically independent of 
C, has to be chosen. The Inverse TF relation (ITF) or the 
Direct TF relation (DTF) appears indeed as a peculiar 
choice of this variable £. As long as the selection effects in 
observation are taken into account, we have shown that 
the introduction of such a statistical model permits to 
derive unbiased statistics for distance of galaxies as well 
as for the Hubble's constant by using standard statistical 
methods like the maximum likelihood technic (TLR and 
Triay et al. (1995), hereafter TRL). In this present paper, 
we focus on the infuence of the statistical model chosen 
to mimic the genuine TF relation in the M-p plane. The 
problems of biases related to selection effects in obser- 
vation, measurement errors or the spatial distribution of 
sources are investigated in TLR and TRL. 

In section 2, we generalize some results obtained in 
TLR by introducing a class of statistical models indexed 
by an angle parameter a. This class of a-models forms 
a continuous set of models including the ITF and DTF 
approaches as boundary cases. We derive the maximum 
likelihood statistics for the 5 model-dependent parameters 
a a , b a , a", an d characterising an a-model and we 
illustrate their variations with respect to the angle param- 
eter a. Assuming standard working hypothesis (a gaussian 
luminosity function), we prove in section 3 that all these 
a-models are indeed mathematically equivalent : i.e. they 
describe the same physical data distribution in the M-p 
plane. It thus turns out that as long as the 5 parameters 
a a , b a , a" , an d are known for a given a-model (say 
the ITF model for example), we can deduce the corre- 



for the DTF model). These formulae of correspondence 
are derived in section 4. This property permits indeed to 
use a different statistical model for calibrating the TF re- 
lation and for determining the distance of galaxies or the 
Hubble's constant. In practice, the best suitable model will 
thus be chosen with regard to the selection effects affecting 
the samples during these 2 steps. We analyse the general 
case (i.e : non-gaussian luminosity functions) in section 5. 
We show that the TF relation remains fully characterized 
by the knowledge of the one and second order moments in 
M and p, but that it is no longer possible to describe the 
data with the ITF and DTF models simultaneously. This 
implies that it doesn't exist formulae of correspondence for 
the general case. However in practice, the ITF and DTF 
descriptions are found to be sufficiently accurate approx- 
imations as long as the correlation coefficient of the TF 
relation is close to 1, which is the case for the real data. 
Standard notations and useful formulae used throughout 
the text are given in appendix A. 

2. The set of the a-models 

Herein, we specify the theoretical probability density (pd) 
describing the distribution of variables involved in the TF 
relation. These variables are related to intrinsic quantities 
of sources (galaxies), which are : 

— the absolute magnitude M , 

— the log line width distance estimator p = logVF. 

Regardless of selection effects in observation, measure- 
ment errors and of the distribution of sources in space, the 
theoretical pd describing the distribution of these variables 
in the M-p plane can be written : 



dP th = F(M,p) dMdp 



(1) 



The observed linear correlation between M and p (the 
TF relation) constrains the probability density function 
(pdf) F(M,p) to adopt a specific form. In fact, it exists a 
straight line Atf of equation M(p) = ap + b such that 
the data in the M-p plane are distributed about this line. 
The slope a and the zero point b of this line are unknown 
quantities which will be estimated during a preliminar cal- 
ibration step. In TLR we have shown that it is convenient 
to express this intrinsic scatter about the line Atf by 
introducing a random variable £ of zero mean and of dis- 
persion <tq defined as follows : 



( = M(p) - M = ap + b - M 



(2) 



A second random variable £ statistically independent of £ 
is required in order to fully specify the statistical model 
(i.e. the pdf F(M,p)) characterizing the data distribution 
in the M-p plane 1 . 



1 In the absence of a better physical understanding of the 
TF relation, the parameters a and b have to be determinated 



introducing a set of models characterized by the choice 
of this second variable £. We define a family of model 
dependent variables indexed by an angle parameter a 
varying continuously from to tt/2 : 



£ a = cos aM + sin a a a p 



(3) 



where we rewrite the Eq. (2) as follows (£ is model depen- 
dent, see footnote 1) : 



( a = M a (p) - M = a a p + b a - M 



(4) 



The random variable is a linear combination of M and 
p and is statistically independent of the random variable 
characterizing the TF relation : 



cov(r,o = o 



(5) 



see Def. 4 in the appendix A. We have thus introduced a 
set of statistical a-models describing the TF diagram by 
the following pd : 



dP th « dP t a h = f^(i a n a g(C;Q, <K C 



(6) 



In order to entirely characterize an a-model, we need to 
specify the pdf g(( a ; 0, a") and f^(^ a ). The distribution 
of the random variable ( a is chosen gaussian and we limit 
ourselves in this section to the case of a gaussian distribu- 
tion for ^ a (the general case is discussed section 5). Our 
working hypothesis are then a gaussian pdf of zero mean 
and of dispersion a" for the random variable ( a charac- 
terizing the intrinsic scatter about the straight line A TF 
and a gaussian pdf of mean £q an d of dispersion for 
the second random variable ^ a . Note that our a-models 
are based on the independence of ( a and , which means 
in particular that a" doesn't depend on , and and 
cr« on C" : 

g(C;0,*?) = g G (C;0,a?) (7) 

fe*(0 = 9 G (C;^,«l) (8) 

see Def. 1 in the appendix A. Finally, the pd describing 
an a-model reads as follows : 



dP f a h = gaiC^o, <Tf K° 9g(C; 0, a a c )dC 



(9) 



Note that the set of the a-models describes the Direct TF 
relation (p and £ are statistically independent) and the 
Inverse TF relation (M and £ are statistiscally indepen- 
dent) when the angle parameter a is equal to its boundary 
values : 







ITF 



g G (M; M , a M )dM ^(C 1 ; 0, a^d? 



(10) 



using a statistical process (the calibration step). Thus, these 
parameters a and b and so the random variables £ and £ depend 
on the statistical model used to describe the data distribution 
in the M-p plane : they are model dependent. 



DTF : < (11) 
[dP» = 9 g(p;po, ° P )dp 9g(C d ;0, ^)d( D 

The next step of the analysis is to derive the 5 model 
dependent parameters a a , b a , a", £q and characteris- 
ing an a-model from a calibration sample (containing N 
galaxies with observed M and p, {Mk,pk} k=1 N ). Herein, 
we use the maximum likelihood technic (see Def. 3 in the 
appendix A) to derive these statistics which are presented 
in the appendix B. We illustrate in Fig. 1 the variation of 
these 5 model dependent parameters for a varying contin- 
uously from to 7r/2. The simulated calibration sample is 
chosen large enough in order that averaged quantities are 
close to their expected values. 

In particular the general statistics of appendix B for 
the ITF model (a = 0) reads (with the correlation coeffi- 
cient p(p, M) defined in Def. 4 of the appendix A) : 



'C 



E(M) 2 
Cov(p, M) 

(M) — ^ 
£(M) 2 



Gov(p, M) 
1 



1 



e 



(M) 
a} 2 = £(M) 2 

and for the DTF model (a = tt/2) 
d _ Cov(p,M) 



& D = (M) 



Cov(p, M) 
S(p) 2 



so 



E(M) 2 (l-p(p,Mf) 
a° 2 S(p) 2 = p(p, M) 2 S(M) 2 



(12) 
(13) 

(14) 

(15) 
(16) 

(17) 

(18) 
(19) 
(20) 
(21) 



3. Equivalence of the a-models 



In substituting the general statistics of the model depen- 
dent parameters a a , b a , a", £q and in the pd of Eq. 
(9), we find that, for every a belonging to [0,7r/2] (see 
appendix C for detailed calculations) : 

Va £ [0,tt/2] : dP« h = 
g G (M; (M) + (p - (p)) , E(M)yT^7) (22) 

xgaip; (p),S(p)) dMdp 

It thus means that all the a-models are indeed mathe- 
matically equivalent. They describe the same physical dis- 
tribution of data in the M-p plane. Note that, by using 



rewrite Eq. (22) as follows : 
Va £ [0,tt/2] : dP« h = 

9G(p; (P) + ^ffef 1 ( M ~ < M » > mV^?) (23) 



xg G (M; (M), S(M)) dMdp 



The symmetric expression in M and p of Eq. (22, 23) reads 
finally as a binormal pdf'ui M and p, entirely characterized 
by its 5 moments of first and second order (p), (M), £(f>), 
E(M) and Cov(p, M) : 



Va £ [0,tt/2] : dP { 



th 



2tt£(M)£(p)^1- p(p,M) 2 



exp 



2(l-p( v ,M) 2 ) 



x ( (P-(P)) 2 _ 2 Cov(p,M)(p-{p))(M-{M)) (M-{M)f \ 1 
A ^ £(p) 2 z £(M) 2 £(p) 2 ^ S(M) 2 J j 



(24) 



x dMdp 

We now understand that our working hypothesis (2 gaus- 
sian pdfioi ^ a and ( a ) imply that the knowledge of the 5 
parameters a a , b a , a", £q and for a given a-model is 
sufficient to entirely describe the data distribution of the 
TF diagram 2 . 

The statistical problem underlying the Tully-Fisher re- 
lation is sometimes expressed in the literature in terms 
of conditional probability (see for example Hendry et al. 
(1994) or Lynden-Bell et al. (1988)). The Direct Tully- 
Fisher approach consists in defining for a given_ value of 
the parameter p, a mean absolute mag nitude M D (p) de- 
pending linearly on p : 



M D (p) = a D p + b D 



(25) 



where a D and & D (the slope and the zero point of the 
Direct Tully-Fisher line) correspond in our formalism to 
a a and b a with a = 7r/2 (see Eq. (17, 18)). The conditional 
pd of M given p takes then the following form : 



dP th (M\p) = P th (M\p) dM 

= g G (M;M D (p),af) dM 



(26) 



The distribution of the M's is centered on M D (p) and of 
dispersion crP. Finally, the pd in the M-p plane reads : 



dPth = ga(p; (p), S(p))P th (M|p) dMdp 



(27) 



which is Eq. (22) when substituting the calibration pa- 
rameters a D , & D and crP by their estimates given in Eq. 



2 Weaker hypothesis on the 2 pdf oblige us to take into ac- 
count the higher order moments of the bivariate distribution in 
M and p. Thus, the a-models are no longer strictly equivalent. 
This case is developed in section 5. 



Fig. 1. Variations of the 5 model dependent parameters a a , b a , a", £g and with respect to the angle parameter a. 



is based on the introduction of a mean parameter p l (M) 
for a given value of M : 



p^M) 



7 M 



(28) 



with a 1 and & 1 the splope and the zero point of the Inverse 
Tully-Fisher line (see Eq. (12, 13)). The conditional pd of 
p given M is expressed as follows : 



dP th (p\M) = P tb (p\M) dp 

= gG(p;p I (M),a 1 c /a 1 ) dp 



(29) 



And the pd describing the data in the M-p plane is derived 
from Eq. (23) by using Eq. (12, 13, 14). 



dPtb = g G (M- (M), S(M))P t h(p|M) dMdp 



4. Correspondence between the a-models 



(30) 



We have shown in section 3 that, with our standard work- 
ing hypothesis, all the a-models are indeed mathemati- 
cally equivalent. It thus turns out that if the 5 calibration 
parameters which describe the data distribution in the M- 
p plane are known for a given a-model, we can deduce the 
corresponding 5 parameters for every a-models. We derive 
in appendix D these general formulae of correspondence. 
Herein, we present these formulae when the angles a and 
[3 are equal to their boundary values. If the calibration 
parameters of the Inverse Tully-Fisher relation are known 
and we want to infer the calibration parameters of the 
Direct Tully-Fisher relation (a = and [3 = tt/2) : 



i^i I z 



\ i i 
p a 



b D 



i^r ^ , „i 4 ~ i 2 _ „i 2 i 2 



i - p L ^ + p L = p L *, 



„i 4 i _i 



p K + < T c )=p °\ 



(31) 

(32) 
(33) 
(34) 
(35) 



Or conversely if the calibration parameters of the Direct 
Tully-Fisher relation are known and we want to deduce the 
calibration parameters of the Inverse Tully-Fisher relation 
(a = tt/2 and /? = 0) : 



l 



& 1 

e 



p u 



(36) 

(37) 
(38) 



1 



D2 



(39) 
(40) 



5. Generalization and discussion 



Assuming that the M-p distribution is a binormal pdf, 
we have outlined in section 3 that the ITF and DTF ap- 
proaches are indeed equivalent and that the Tully-Fisher 
relation in this special case is entirely characterized by the 
five moments (of first and second order) (p), (M), £(f>), 
E(M) and Cov(p, M). However such a property vanishes 
if the luminosity function is not gaussian. We show in 
the appendix E that the ITF and DTF approaches are 
no longer strictly equivalent for general luminosity func- 
tions. In order to preserve this equivalence, the luminosity 
function /m(M) of the analysed galaxies has to verify the 
following equation : 



f M (M) « frec(M) 



f M {x) Wm{x) dx 



(41) 



with p = p(M, p) the correlation coefficient of the data 
distribution and Wm(%) given by (see appendix E) : 



Wm(x) 



fro (x; m + ^(m - (m)), ^yr^7 



(42) 



This condition is achieved when the luminosity function is 
gaussian but is ruled out in general. The errors introduced 
by describing indifferently the data distribution with the 
ITF or DTF models (or with some a-model) can be quan- 
tified by using a Kolmogorov-Smirnov test as test of good- 
ness of fit. The amplitude of these errors on the distance 
estimates of galaxies depends on the characteristics of the 
galaxies population (luminosity function, correlation co- 
efficient of the TF relation, etc ...). Hence, we do not es- 
timate numerically the amplitude of the errors and only 
speculate qualitatively on their general behaviour. We use 
the discrepancy between /m(M) and f rec (M) as a mea- 
sure of the errors introduced by describing silmutaneously 
the data distribution with the ITF and DTF models. Eq 
(42,41) show that this discrepancy is negligible as long 
as the correlation coefficient of the TF relation is close 
to 1 (which is the case for the real data). We illustrate 
this feature (Fig. 2) by plotting /m(M) and f rec (M) for a 
Schechter luminosity function. We see that the condition 
of equation (41) is poorly verified for small correlation 
coefficient (p = —0.85). In this case, it becomes danger- 
ous to use a prion a model to describe the Tully-Fisher 
relation. A bad choice of this model can create spurious 
phenomena such as the variation with M or p of the slope 
or the dispersion of the Tully-Fisher relation. Similar ef- 
fects may also appear when calibrating the TF relation 



Fig. 2. Comparison between /m(A^) and f r ec(M) for a Schechter luminosity function. 



in distant clusters since the apparent magnitude cut-off 
favours the selection of high absolute magnitude galaxies 
for which the discrepancy between /m(M) and f rec (M) 
is large. This discrepancy however decreases when adopt- 
ing for the correlation coefficient a more realistic value 
p = —0.95 (except for the high magnitudes, which is not 
a prejudice since the Schechter high magnitude cut-off is 
not physical). Because the f rec (M) curve for p = —0.95 
match well the Schechter function, it seems that the ITF 
and DTF models (and so the set of the a-models) are close 
to be equivalent for the real observed data. It follows that 
the formulae of correspondence derived in section 4 ap- 
pears as fair approximations even for non-gaussian lumi- 
nosity functions 3 . 

6. Conclusion 

In order to mimic the Tully-Fisher diagram, we have in- 
troduced a continuous set of statistical models character- 
ized by the straight line A^ F describing the observed lin- 
ear correlation of M and p. This set of a-models includes 

3 Note however that if the shape of the luminosity function is 
of no importance during the calibration step, it can be decisive 
for the bias correction procedure involved in the determination 
of the distances of galaxies (see Bicknell (1992)). 



the ITF and DTF relation as boundary cases. Assuming 
a gaussian luminosity function, we have shown that all 
these a-models describe indeed the same physical data 
distribution in the M-p plane. Thus, if the 5 calibration 
parameters a a , b a , a", £q and are known for a given 
a-model, we can infer the calibration parameters of every 
a-models by using formulae of correspondence. We have 
shown that this property remains valid for non-gaussian 
luminosity functions as long as the correlation coefficient 
of the TF diagram is close to 1 (which is the case for 
the real data). In practice such a property allows us the 
possibility to use a different statistical model during the 
calibration step of the TF relation and for determining the 
distances of galaxies. The best suitable statistical model 
will thus be chosen with regard to the selection effects in 
observation affecting the samples during each of these 2 
steps. 

For example, the ITF model seems to be more ade- 
quate for calibrating the TF relation because of its ro- 
bustness (the estimates of a 1 , & 1 and it* do not depend on 
the luminosity function (Hendry et al. (1990), TLR) but 
also because when calibrating the ITF relation in a cluster, 
the estimates of a 1 and aj- don't depend on the distance of 
the cluster (Schechter (1980), Tully (1988), Lynden-Bell et 
al. (1988),Teerikorpi (1990), Hendry et al. (1990), Rauzy 



preferred to determine the distances of galaxies. It is more 
accurate (the intrinsic scatter of the DTF relation af is 
indeed smaller than the ITF one a\ (Tully (1988), TLR)), 
more robust (the DTF distance estimator doesn't depend 
on the luminosity function (TLR)) and more intuitive (an 
observed p gives directly a value of M : M(p) = a°p + & D 
(Bottinelli et al. (1986), Fouque et (al. 1990))). 



A. Notations and useful formulae 

The mathematical formalism is similar to the one used in 
Bigot & Triay (1990). The following features are addressed 
throughout the text by using the symbol "Def." . 

Def.l The probability density (pd) of a random variable x 
reads dP(x) = f(x)dx, where f(x) represents the pd 
function (pdf), we have J dP(x) = 1. Sometimes, it 
is useful to exhibit the model parameters involved in 
the statistical model, as the mean xo and the standard 
deviation a, by writing f(x; xo, &)■ 
(&) g G (x;x ,cr) = (a^/^ir)- 1 exp - ({x - x )' 2 / {'la' 2 )) 

is a Gaussian pdf. 
(b) A normal pdf can be written gjsi(x) = g G (x;0, 1). 

Def.2 Let / be a pdf and A be a scalar value, in most of 
calculations, we use the following properties : 

(a) f(x + A; x , a) = f(x; x - A, a); 

(b) f(\x;x ,a) = \- 1 f(x;f,f); 

(c) g G (x; xi, a 1 )g G (x; x 2 , cr 2 ) = 

g G (x; x , a- )g G (xi;x 2 , &), where a = \/ o\ + a\, x 
and Co are defined as follows a^ 2 = <rj~ 2 + c^~ 2 and 

XoUq 2 = X\(T^ 2 + X2&2 2 ■ 

Def. 3 The pd of a sample data {Gk}j, = i jy, which consists 
of N independently selected objects Qj., is given by 
YlLldPiGk). 

(a) Its pdf written in terms of observables (the measur- 
able random variables), but regarded as a function 
of model parameters, provides us with the likeli- 
hood function. 

(b) ( The ML method.) The model parameters statistics 
are obtained by maximizing the likelihood function, 
or (equivalently) the natural logarithm of the effi- 
cient part of it, in which the terms which do not 
contribute to the determination of parameters are 
removed, herein briefly denoted by //. 

Def A We use the following usual definitions : 

(a) (x) = X^fcLi X k/N is the average, 

(b) Cov(x,y) = - (x))(y k - {y))/(N - 1) is 
the covariance, 

(c) £(«) = \J Cov(x, x) is the standard deviation, 

(d) p(x,y) = Cov(;c, y) / (T,(x)T,(y)) is the correlation 
coefficient. 



We have to express the pd of Eq. (9) in terms of the ob- 
servables M and p. In substituting Eq. (3,4) in Eq. (9), 
we obtain : 



dP t 



th 



g G (c a M + 8 a a a p;Zg,<r%) 



xg G (a a p+b a -M;0,(T«) 
x \a a (c a + s a )\ dMdp 



(Bl) 



where c a = cos a, s a = sin a and \a a (c a + s a )\ is 
the Jacobian of the coordinate transformation. The nat- 
ural logarithm of the likelihood function (If) C a = 
C a (a a , b a , a?, ^0 ; <?"?) as a function of the 5 parameters 



of the model reads : 
C a = +ln|a a | +ln(c° 



+ s a ) 



In a 



e n 



1 (c°« + '°">-g) 2 



k = l 



(B2) 



1_ v^AT (a a p + b a -M) 2 



2(7° 



Maximizing C a with respect to these 5 parameters furnish 
the following set of equations (see Def. 3 and Def. 4 of the 
appendix A) : 

-^2 (s a p(c a M + s a a a p - 
3 aa £ a = 0^1 ' (B3) 

(p(a a p + b a - M)) + £ = 



8 b cC a = => ((a a p + b a - M)) = 



+ 2^((a a p + b a - M) 2 ) 



d aa2 C a = 
< 



2(7 



^ = 



(B4) 
(B5) 



d^C" = => ((c a M + s a a a p - («)) = (B6) 

>-€o a ) 2 } 

(B7) 



+ 1 ^((c a M + s a a a p-CS) 2 ) 



d aa2 C a = 
i 



■2^ = 



Solving this set of equations provides the estimates of the 
5 model dependent parameters a a , b a , a" , an d a £ char- 
acterising an a-model. Eq. (B4) furnishes the statistic of 
b a : 

b a = (M) - a a (p) (B8) 
Substituting Eq. (B8) in Eq. (B5) gives : 
af = £(M) 2 - 2a a Cov(p, M) + a a2 £(p) 2 (B9) 
Eq. (B6) gives the statistic of £q : 

= c a (M) + s a a a (p) (BIO) 



v«2 



Herein we show, by substituting the general statistics of 
(Bll) ^ 6 m °d e l dependent parameters a a , b a , a" , £q and in 
the pd of Eq. (9), that all the a-models are indeed mathe- 
matically equivalent. For every a belonging to [0, tt/2] and 
by using the properties (a), (b) and (c) of the Def. 2 of 
a a (p-{p)) and Y = (M - (M)) (with these notations Eq. the a PP en dix A, we rewrite Eq. (9) as follows : 
(B9) and Eq. (Bll) rewrite af = ((X - Y) 2 ) and af - 



a'f = c a2 Y>(M) 2 + 2s a c a a a Cov(p, M) 



+ s a2 a a2 Z(p) 2 
In order to evaluate the a a statistic, we define X 



{(s a X - c a Y) 2 )). In substituting Eq. (B8,B9,B10,B11) in 
Eq. (B3), we obtain (with X and Y defined above) : 



dP t 



th 



xg G (M;a a p + b a ,a^) l a "( c J+ s ")l dMdp 



(CI) 



1 (X(X-Y)) ( 8 °X( 8 °X-c°Y)) _ Q 



(B13) 



((X-Y) 2 } ((s a X-c a Y) 2 ) 
which reduces to : 
C {s a X(s a X - c a Y)) {(X - Y) 2 ) 

{+ (Y(X - Y)) ((s a X - c a Yf) = 
and by developing and factorizing this equation 

C (s a £(X) 2 - c a £(Y) 2 + (c a - s a )Cov(X, Y)) 
\x (c« 2 S(Y) 2 + s« 2 S(X) 2 ) =0 
which rewrites : 

C (a a2 s a Z(p) 2 - c a S(M) 2 + a a (c a - s a )Cov(p, M)) 

\ x (c a2 £(M) 2 + a a2 s a2 Y<(p) 2 ) = 
which reduces to 4 : 

a a2 s a T 1 (p) 2 + a a (c a -s a )Cov(p,M)-c a T 1 (M) 2 = (B16) 



(B12) 

And by introducing the variables x , a x , y and a y , we 
express the pd of Eq. (9) as the product of 2 gaussians 
depending on M and p : 



dP t 



th 



) 



xg G (M;x ,a x ) |a ° (c e l +s ° )l dMdp 
dP" h = g G (p; ?/o, a y )g G (M; x , a x ) dMdp 



(C2) 
(C3) 



(B14) The variables xo, & x , Do and a y thus verify the following 
equalities : 



c a2 a a2 + ff a2 



(B15) 



x 



yo 



c a2 a a2 + ff a2 



c a (rf 2 - s a a a p) + af 2 (a a p + b a ) 
1 

a a2 (c a + s a ) 2 

- c a b a 
a a (c a + s a ) 



a2 a2 , a2 
C <7£ + (7 



(C4) 

(C5) 
(C6) 
(C7) 



The a a statistic is obtained straightforwardly if the co- 
efficient of the second order term of Eq. (B16) vanishes h Y replacing the statistics of a a , b a , <r«, and cr« derived 



if a = : a c 



S(M) 2 
Cov(p, M) 



in Eq. (B8, B9, BIO, Bll, B17, B18), we find that for every 
a belonging to [0, tt/2] 



(B17) 



or else the a a statistic is obtained by solving the second 
order equation of Eq. (B16) : 

if a ^ : a a = 
S% { (s a ~ c«)Cov(p, M) + sign [Cov(p, M)] (Blg) 

x y/(s a - c«) 2 Cov(p, M f + 4c«s«S(M) 2 S(p) 2 | 



S(M) 2 S(p) 2 - Cov(p,M) 2 
Z(pj~ 2 

E(M) 2 (l-p(p,M) 2 ) 
, , Cov(», M) . , 1X 



'y 

yo 



m 2 



(C8) 

(C9) 

(CIO) 

(Cll) 
(C12) 



And substituting Eq. (C8, CIO, Cll, C12) in Eq. (C3) 
gives Eq. (22) as required. 



We have thus derived the statistics providing an estimate 

of the 5 parameters a a , b a , a", £q and characterizing D. General formulae of correspondence 
an a model. The starting point is to assume that a", b a , a? , £q and a 



this equation can also be obtained by developing Eq. (5). are known for a given a-model. By inverting the system 



Cov(p, M) 

£(M) 2 
(M) 



c a2 a" 2 


+ of 2 


a a2 (c a ■ 


+ s a ) 2 


rv ? 

(7^ - C 


C 


a a2 (c a 

V 


+ S«) 2 

1 / 


2 ex. 2 


rv 9 


c 


+ °l 


(c« + 


s a f 


^ + 





minosity function for example). The theoretical probabil- 
ity density pd of a galaxy reads thus as follows : 



dP tb = f M (M)dM gaiC 1 ; 0, al)^ 1 



c a + s a 
& - c a b a 
a a (c a + s a ) 



(Dl) 
(D2) 
(D3) 

(D4) dP th = g p (p)dp g G (( D ;0, a^)d( D 



(El) 



Is it now possible to describe this same data distribution 
by using the DTF model ?. If so, the theoretical pd of Eq. 
(El) can also be written : 



(E2) 



(D5) where g p (p) is the distribution function of the p's and can 
be obtained by integrating over M the pdf of Eq. (El). In 

„ „ the same way, the luminosity function /m(M) is equal to 

And so we can deduce the 5 parameters a", 6", a ( , £„ and the integral over p of the pd f Q f Eq . ( E2 ). It means that if 

(Tj for every angle parameter (3 belonging to [0, 7r/2], by re- we want to describe simultaneously the data distribution 

placing the values of (p) , (M), S(M) and Cov(p, M) by Eq. (El) and Eq. (E2), the luminosity function has to 

given in Eq. (D5, D4, Dl, D3, D2), in Eq. (B8, B9, BIO, satisfy the following equation : 
Bll, B17, B18) applied to a /3-model. In the peculiar case 

13 = 0, Eq. (B17) gives : hAM)=L,JM)= I f*,(x\W*,(x\dx (E3) 



if /3 = 



s a2 a? 2 



'( 



T a2 



c a s a a" 2 



And for the general case, we deduce a 13 from Eq. (B18) 



a a x 



c a s a tj? 2 ], we obtain 



'C 



With sgn = sign [u^ 2 
if 13 ^0 : a? 

+sgn x [(s^ 3 - C P) 2 (af - c a s a af) 2 

1 1/2 "1 

+ 4c! 3 si 3 (s a2 af 2 + af){c a2 af + af)\ \ 
Finally, b 13 , , £q an d ^ are derived using Eq. (B8, B9, 



/Af(M) = frec(M) = j f M (x)W M (x)dx 

(D6) with W M {x) given by : 

Wm{x) = a 1 J dp ga(x — a l p — b l ; 0, <r*) 

xg G (M -a D p-b D ;0,af) 



(E4) 



(D7) 



By using the properties (a), (b) and (c) of Def. 2, this 
equation rewrites : 

Wm(x) = 4x 



g G [x-M + (^- 1)M + & 1 - ^& D , Jaf 2 + (£af) 



(E5) 



BIO, Bll). We set A 
b {3 = A a K«(l 



Because the random variables M and ( l are independent 
in Eq. (El), the statistics given in Eq. (12, 13, 14) of a 1 , & 1 
and al obtained by using the maximum likelihood technic 



c a -\-s c 



a? a? 
—) + b a (s a + c a — ) 
a a ' v a a ' 



in the appendix B remains valid. In the same way, p and 
C D are independent in Eq. (E2), and so the statistics given 
(D8) in Eq. (17, 18, 19) still furnish us the estimates of a D , & D 
and crP . It implies that the following equations holds : 



A 



(D9) 



(D10) 



<f_ _ S(M) 2 S(p) 2 



1 



a 



Cov(p, My 



p(p, M) 
1 \ 



A 



a2 



< V + s^-f + <T a c \cPs a - 8 Pc a ^-f ) (Dll) 



E. Non- Gaussian luminosity functions 



p(M, P y 



S(M) 2 



(M) 



p(M,p) 4 



(E6) 
(E7) 
(E8) 



Let us assume that the ITF model describes the data dis- Substituting the statistics of Eq. (E6, E7, E8) in Eq. (E5) 
tribution of some sample of galaxies characterized by a finally gives Eq. (42) as required. 
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